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Saddlepoint  Approximations  in  Conditional  Inference 


Suojin  Wang 

Department  of  Statistical  Science 
Southern  Methodist  University 
Dallas,  TX  75275 

SUMMARY 

Saddlepoint  approximations  are  derived  for  the  conditional  cumulative  distribution  fimction  and 
density  of  X  |  g(X,  T),  where  (X,  ?)  is  the  sample  mean  of  n  iid  bivariate  random  variables  and 
g(x,  y)  is  a  nonlinear  function  of  x  and  y.  The  relative  error  of  order  0(n~^)  is  retained.  Such 
approximations  are  useful  in  conditional  inference,  especially  in  the  case  of  small  or  moderate  sample 
sizes.  Generalizations  to  higher  dimensional  random  vectors  are  also  discussed.  Two  examples  are 
demonstrated  and  some  numerical  results  are  provided  to  show  the  extraordinary  acctiracy  of  the  new 
approximations. 


A’ey  words:  Asymptotic  expansion;  Conditional  density;  Conditional  distribution;  Conditional 
inference;  Nonlinear  conditioning;  Saddlepoint  approximation. 
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1.  INTRODUCTION 

Conditional  inference  plays  an  important  role  in  statistical  inference.  The  conditionality  principle 
has  been  used  to  deal  with  various  problems. ''.Recent  developments  on  this  issue  include  important 
contributions  by  Barndorff- Nielsen  and  Reid  (1987),  Fraser  amd  Reid  (1988),  Diciccio, 

Field  and  Fraser  (1990)  and  others.  One  major  step  in  the  procedure  of  conditional  inference  is  to 
obtain  the  conditional  distribution  functions.  As  is  often  the  case,  the  exact  conditional  distributions 
are  difficult  or  impossible  to  obtain,  and  conventional  approximations  may  often  fail  to  work.  For 
example,  generally  it  is  hard  to  calculate  the  moments  of  the  conditional  dbtributions  which  are 
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necessary  quantities  for  the  Edgeworth  approximations.  Furthermore,  these  approximations  are  often 
unsatisfactory  for  small  or  moderate  sample  sizes. 

On  the  other  hand,  it  is  well  known  that  saddlepoint  expansions  lead  to  accurate  approximations, 
even  for  small  sample  sizes.  \Vmong  other  papers,  BamdortT-Nielsen  and  Cox  (1979)  and  Reid  (1988) 
have  discussed  the  importance  and  usefulness  of  the  saddlepoint  approximations  in  statistics.  Davison 
and  Hinldey  (1988)  apply  tlie  saddlepoint  method  (see  Daniels  (1987))  to  resampling  problems.  Using 
the  saddlepoint  technique  twice,  Skovgaard  (1987)  has  derived  an  accurate  approximation  to  the 
conditional  distribution  of  a  sample  mean  given  a  p— 1  dimensional  linear  function  of  a  p  dimensional 
variable  which  has  been  applied  by  Davison  (1988)  and  Davison  and  Hinldey  (1988)  to  generalized 
linoAr  models  and  resampling  analysis.  Wang  (1990b)  derives  saddlepoint  formulas  for  bivariate 
distribution  functions.  But  it  is  seen  that  most  applications  are  limited  to  the  sample  mean  and  other 
simple  lin#.a.r  statistics  due  to  the  strong  requirements  of  the  saddlepoint  technique  on  the  moment 
generating  function  of  the  statistics  under  consideration. 

However,  consideration  of  more  general  statistics  is  often  desired.  In  particular,  conditional 
inference  often  requires  a  distribution  function  conditioning  on  nonlinear  functions  of  a  sample  mean  of 
multivariate  random  variables.  Recent  work  on  saddlepoint  expansions  includes  expansions  for  some 
specific  nonlinear  statistics  by  Srivastava  and  Yau  (1989)  and  Wang  (1990c). 

^  ]  In-  this  paper  w^  derivef^  accurate  saddlepoint  expansions  for  the  case  of  nonlinear  conditioning. 

f 

The  results  include  Skovgaard’s  (1987)  method  as  a  special  case  when  the  distribution  is  continuous, 
but  have  much  broader  applications.  Sections  2  and  3  expand  saddlepoint  formulas  for  the  conditional 
density  and  conditional  distribution  function,  respectively.  Two  examples  are  considered  in  ^ection  4 
to  illustrate  the  use  of  the  new  results.  Extraordinary  accuracy  is  also  shown  numerically.  ^  p 


2.  SADDLEPOINT  EXP.\NSIONS  FOR  THE  CONDITIONAL  DENSITY 


We  consider  first  the  bivariate  case,  and  generalize  results  in  the  p  (>  2)  dimensional  case  at  the 
end  of  Section  3.  Assume  that  (X^,  Y^),  .  .  .  ,  (Xq,  Yq)  are  independent  identically  distributed 
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bivariate  continuous  random  variables  with  the  cumuiant  generating  function  K(u,  v)  existing  in  a 
neighborhood  of  (0,  0).  Let 

T  =  CiX  +  CjY 

and  Z  =  g(X,  7)  be  a  bivariate  function  on  the  sample  space,  where  c^  and  Cj  are  given  constants,  X 
and  Y  be  the  sample  means  of  X>’s  and  Y^’s,  respectively.  We  wish  to  expand  the  conditional 
distribution  Ha(to  1  z)  =  Pr(T  <  tg  |  Z  =  z)  and  the  corresponding  density.  Without  loss  of  generality, 
let  T  =  %  and  assume  that  the  transformation  from  (X,  Y)  to  (T,  Z)  is  one  to  one.  Then  the  inverse 
transform  can  be  written  as  (%,  7)  =  (T,  q(T,  Z))  for  some  function  q.  When  the  transform  is  not  one 
to  one,  the  domain  should  be  partitioned  so  that  in  each  portion  the  transform  is  one  to  one. 

In  this  section  we  consider  the  conditional  density.  It  is  well  Imown  that  the  density  Pn(x,  y)  of 
(X,  7)  has  the  following  saddlepoint  approximation  (equation  (1)  of  Reid  (1988)): 

Pn(x.  y)  =  Ps(x.y)  {1  +  0(n“‘)},  (1) 

where 

Ps(*»  y)  = - ^ ^  exp  {n(K(u,  u)  -  ux  -  uy]},  (2) 

2x{D(u,  u)}"'"  ^ 

D(u,  v)  =  K  w  ~  (Kuv)’}  1  s  I  K(u,  v)  I  ,  the  determinant  of  the  2  x  2  matrix  of  the 

second  partial  derivatives  of  K,  and  (u,  v)  is  the  saddlepoint  defined  by 


K(u,  V)  =  X 
K(u,  v)  =  y  . 


(3) 


Here  we  adapt  Skovgaard’s  notations  for  partial  derivatives,  e.g.,  Kyu  is  the  second  partial  derivative 
of  K  with  respect  to  u,  etc.  The  relative  error  in  (1)  is  of  order  n~^  uniformly  over  compact  sets.  In 
this  paper  we  will  use  **2:”  to  denote  an  approximation  having  such  an  error.  Letting  (x,  y)  =  (t,  q(t. 
z))  and  still  using  (u,  v)  to  denote  the  corresponding  solution  in  (3),  it  is  easily  seen  from  (1)  that  the 
density  hQ(t,  z)  of  (T,  Z)  can  be  expressed  as 


hn(‘i  z)  =  Pn(t,  q(t,  z))  I  J  I  =;  hg(t,  z), 
3 


(4) 


where 


ha(t,  2)  = 


a  I  J 


2t{D(u,  v)} 


1/2 


exp  ^n(K(u,  v)  -  ut  -  uq(t,  2)]  j, 


and  J  =  qz(t,  2)  s  {qy(x,  y)}  ^  is  the  Jacobian  factor.  To  approximate  the  conditional  density 
hn(t  I  2)  of  T  I  (Z  =  2),  it  is  sufficient  to  obtain  an  approximation  to  the  marginal  density  hQ(2)  for  Z. 
By  (4), 


hn(t,  2)  dt 


,00 

2:  J  I 


h9(t,  z)dt 


—00 


(5) 


Let 


L(t)  =  K(u,  v)  -  ut  -  vq(t,  2),  (6) 

where  2  is  fixed  so  that  u  and  v  depend  on  t  only.  Then  by  (3), 

L'(t)  =  -u  -  V  qj(t,  2)  (7) 

and 

L"(t)  =  {-KyyCu,  ❖)  +  2^t(t,  2)  guv(<i,  -  [qt(t,  *)]’  Kuu(fi,  v)}/  D(u,  v)  -  V  qt^(t,  2).  (8) 

since  by  taking  derivative  with  respect  to  t  of  both  sides  of  (3)  when  (x,  y)  =  (t,  q(t,  2))  we  have 


fit  =  “  ^t(‘’  *)  Kuv(u,  V)}/  D(u,  V) 

^It  =  {-Kuv(u,  V)  +  qt(t,  2)  Kuv(u,  v)}/  D(u,  v)  . 


We  now  assume  that  there  exists  a  unique  solution,  t^,  to 

L'(t)  =  0  (9) 

and  L^^(ti)  <  0.  Following  the  technique  by  Bleistein  (1966),  we  transform  t  to  w  such  that 

(w  -  wJV2  =  L(ti)  -  L(t)  ,  (10) 

where  w^  =  sgn(ti)'^2[L(ti)  —  1(0)]^^*  and  sgn(w— wj  =  sgn(t— tj).  Note  that  w  =  0  and  w  =  Wj 
correspond  to  t  =  0  and  t  =  t^,  respectively.  From  (4),  (5)  and  (10),  it  is  seen  that 
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l»n(*)  =  ^  exp{nL(ti)}  J  exp{-n(w-Wi)V2} - *  ^  ^ 

{D(u,  v)}' 


— oo 


w  =  w. 


This  last  approximatioii  is  obtained  by  applying  Watson’s  lemma  (Je£&eys  and  Jeffreys  (1962)).  But 
by  differentiating  both  sides  of  (10),  we  have 


dt 


Thus, 


l»n(z)  =  (-  -  T///,  N°n/- — I  “P  {“L(ti)}  , 
[  2wL'(tOD(ui,  vjj 


(11) 


(12) 


where  (uj,  vj  is  the  solution  to  (3)  when  (x,  y)  =  (t^,  q(tj,  z)),  and  =  qz(tx,  z)  is  the  Jacobian 
factor  at  t  =  tj.  Noticing  that  the  conditional  density  is  given  by 


V) 

from  (4),  (6)  and  (12)  we  have  reached  the  following  result. 


(13) 


Theorem  1.  Let  T  =  X,  Z  =  g(X,  ?).  Assume  that  the  general  regularity  conditions  described  at  the 
beginning  of  this  section  hold  and  that  the  solution  t^  in  (9)  exists.  Denote  the  inverse  transform  by 
(X,  ?)  =  (T,  q(T,  Z)).  Then  the  conditional  density  of  T  given  Z  =  z  has  the  saddlepoint  expansion 


hn(t  1  z) 


-aL"(t,)  D(u„  V,) 
2x  D(u,  v) 


exp{n[L(t)-L(ti)]}, 


(14) 


where  (u,  v)  and  (uj,  v^)  are  the  solutions  to  (3)  when  (x,  y)  =  (t,  q(t,  z))  and  (tj,  q(ti,  z)), 
respectively,  and  J  and  J^  are  the  Jacobian  factor  at  t  and  t|,  respectively. 
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As  is  seen  in  the  next  section,  the  technique  used  to  derive  (12)  can  be  modified  for  the  purpose  of 
approximating  the  conditional  distribution  function.  Expansion  (12)  is  useful  in  the  derivations. 

3.  SADDLEPOINT  EXPANSION  FOR  THE  CONDITIONAL  DISTRIBUTION 
We  begin  with  the  following  lemma  which  will  be  needed  in  this  section. 


T.^Tnma.  1.  Assume  that  or(x)  is  an  analytic  function.  Then 


J  “W  «P{-Q  xV2}dx  a(0)  *(Vn  y)  -  ^(^fn  y), 

— oo 


(15) 


where  ^  and  4  are  the  standard  normal  density  and  cumulative  distribution  function,  respectively. 
When  y  =  0,  the  expansion  is  or(0)/2  —  ct^(0)/(2Tn)^/*. 


This  lemma  is  a  special  case  of  Tenune  (1982);  see  Tenune  (1982)  for  the  proof. 
By  (4)  and  (6),  the  conditional  distribution  fimction  is 


Hn(tgl  z)  =  J  bn(t,  z)dt 

— oo 

_ LL 

A-  /DAi.  V 


{D(»,  »)}■/’ 


(16) 


Using  the  same  transformation  as  in  (10)  and  denoting  by  Wg  the  transformed  value  of  t  evaluated  at 
tg,  we  have 

Hn(to  I  z)  =:  — J  c(w)  exp  {-n(w  -  wjVS}  dw, 

— oo 

where  c(w)  =  1  J  I  ^  /{D(u,  v)}'/’  .  Therefore  by  Lemma  1  and  formulas  (11)  and  (12),  it  is  easily 
seen  that 
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c(w,)  —  c(w,) 


=  <t(ao)  + 


>-i} 


•^Cao). 


where 


ao  =  Nn(wo-wi)  =  sgn(to-ti)  |2n  [L(ti)-L(to)]}  ^  , 


u  _  *0  ®(^i)  _  T  hu  \ 
^ 


Ji  I  f  nD(uo,  Vq) 
m  L"(tO  D(ui,  vjJ  ’ 


(17) 


and  Jq  and  are  the  Jacobian  factor  at  tg  and  t^,  respectively.  We  summarire  the  above  resiilts  in 
the  following  theorem. 


Theorem  2.  Under  the  conditions  in  Theorem  1,  the  conditional  distribution  function  HQ(tg|  z)  =  Pr(T 
<  tg  I  Z  s  z)  has  the  saddlepcint  expansion  (17).  When  tg  s  t^,  the  expansion  can  be  obtained  by 
taking  the  limit  of  (17)  as  tg— 't^. 


Notice  that  formula  (17)  has  the  same  form  as  Lugannani  and  Rice’s  (1980)  formula,  as  well  as 
those  of  Skovgaard  (1987)  and  Wang  (1990c).  Note  also  that  because  of  the  special  structure  of  the 
saddlepoint  technique  the  expansions  (4),  (12),  (14)  and  (17)  with  relative  error  of  order  n~^  are 
sufBciently  accurate  for  most  applications.  Higher  order  expansions  could  be  obtained  by  referring  to 
Temme  (1982),  although  it  is  algebraically  complicated. 

It  is  straightforward  to  generalize  expansions  (14)  and  (17)  to  the  p  (>  2)  dimensional  case  with 
an  analog  of  Skovgaard  (1987).  Let  (X.,  Y^,  i  =:  1,  .  .  .  ,  n.  be  lid  continuous  p  dimensional  random 
vectors.  Assume  that  (T.  Z)  =  (X.  g(X,  Y))  is  a  one  to  one  transform,  where  Z  —  g(X,  Y)  is  a  p— 1 
dimensional  vector,  and  that  expansions  for  HQ(tg|  z)  =  Pr(T  <  tg|  Z  =  z)  ouid  the  density  ha(tg|  z) 
are  desired.  Then  the  inverse  transform  (X,  Y)  =  (T,  q(T,  Z))  exists  and  the  variable  v  in  the 
eiunulant  generating  function  K(u,  v)  of  (X,  Y)  is  p— 1  dimensional.  Analogously  to  (6),  let 
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L(t)  =  K(u,  v)  —  ut  —  v^q(t,  z), 

where  v*"  is  the  transpose  of  v,  and  let  J  =  57- — • —  be  the  Jacobian  factor  and  D(u.  v)  be  the 

*  5(t,  z) 

determinant  of  the  p  x  p  matrix  of  the  second  partial  derivatives  of  K(u.  v).  Following  the  same 


procedures,  it  is  easily  seen  that  ( 14)  and  ( 17)  also  hold  for  the  general  p  dimensional  case. 


4.  EXAMPLES 

In  this  section  we  consider  two  applications.  The  first  one  is  about  the  conditional  density  and 
conditional  distribution  of  X  |  ^,  where  E(X*— It  is  equivalent  to  consider 

X  I  (Je  X?  -  (X)*).  Let  T  =  X,  ?  =  is  X?  and  Z  =  g(X,  Y)  =  Y-(X)^  Then  the  problem  of 
approximating  HQ(to|  z)  =  Pr(T  <  tg  |  Z  =  z)  and  its  density  hQ(t  |  z)  can  be  handled  well  by  the  new 
results  developed  in  this  paper,  provided  that  the  general  conditions  in  Section  2  are  satisfied.  In 
particular,  when  X-’s  are  normally  distributed,  say  from  N(0,  1),  it  is  easily  calculated  that  the 
saddlepoint  expansions  (14)  and  (17)  are  hs(t  |  z)  =:  '(n  ^(‘in  t)  and  H3(to|  z)  s:  <l>(<(n  tg),  independent 
of  z.  That  is,  the  approximations  become  exact  in  this  case. 

We  shall  discuss  the  second  example  in  a  more  detailed  manner.  In  an  interesting  paper,  Hinldey 
(1977)  posed  the  following  problem.  Let  Wj,  .  .  .  ,  Wq  be  independently  N(^,  c^/i^),  where  c  is  a 
known  constant  and  /i  >  Q  is  the  parameter  of  Interest.  For  simplicity,  let  c  =  1.  We  may  take  as  an 
ancillary  statistic 

Z  =  W  /  E  (18) 

as  pointed  out  by  Hinkley.  By  the  conditionality  principle  (Cox  and  Binkley  (1974),  Sec.  2.2), 
inference  about  ft  is  based  on  the  conditional  distribution  of  T  s  ^E  W?  given  the  observed  value  of  Z. 
A  major  step  in  the  problem  is  to  obtain  HQ(tg|z)  and  hQ(tg|z)  or  sufficiently  accurate  approximations 
to  them.  Expansions  (14)  and  (17)  serve  this  purpose.  To  illustrate,  let  the  true  value  ^  =  I  and  let 
Xj=  W?  and  Yj  =  Wj.  Thus  by  (18),  z  =  g(x,  y)  =  y/rx  and  y  =  q(t.  z)  =  z.  It  is  easily  obtained 
that  the  cumulant  generating  function  of  (X|,  Y.)  is 

K(u,  v)  =  u  +  V  +  ^2(*1-2'u)  "  i  ■ 
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For  (x,  y)  =  (t,  z),  the  solution  to  (3)  is 


- X  1  1  1  '^t  z  , 

<"•  '>  =  ( 2  -  sTTr?)!  ■  (13?);  -  ‘ 


From  (6),  some  algebra  leads  to 

L(t)  =  Z'(l  -  I  +  ^  log  t  +  1  log  (1-z'). 
Note  that  |  z  |  <  1.  Hence  the  solution  t^  to  L'(t)  =  0  is  given  by 

ti  =  (z*  +  2  +  Zi[?+4y2  , 


and  the  condition 


/'(t)  =  -zt  -  t-V2  < 


holds  for  all  t  in  its  domain  (0,  oo).  Moreover,  one  can  obtain  that 


D(u,  v)  =  2|(l-z*)  t}^  and  |  J  |  =  ^  . 


Therefore,  expansion  (14)  reduces  to 


where 


hn(t  I  z)  2i  Cq  exp  |-n(^-z)V2}. 

Cn  =  gj?  +  1^1  «P  {n(.[^-z)V2}. 


Since  the  exact  density  hn(t  |  z)  is  proportional  to  t°^*  ^exp{— n(-it— z)*/2}  (see  Hinkley  (1977)), 
(20)  is  viewed  as  “exact”  if  the  renormalization  is  allowed.  Using  the  above  calculations,  expansion 
(17)  for  HQ(to|  z)  is  easily  computed  with 

ao  =  sgn(t(,-tj)|n  (|t^- jt^  +  4tl-2z)  +  log(ti/to)  | 

*>0  =  |n/(2^  +  2)|  . 
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Table  1  compares  the  saddlepoint  approximation  (17)  with  the  true  conditional  distribution 
function  for  z  =  0.5  and  n  =  3,  10.  The  extraordinary  accuracy  of  the  approximation  is  evident 
uniformly  over  the  domain  of  the  statistic.  Note  that  calculations  show  that  the  relative  error  is  not 
affected  by  different  values  of  z,  i.e.,  the  numerical  accuracy  of  the  expansion  ( 17)  is  stable  for  ail  |  z  | 
<  1. 


5.  CONCLUDING  REMARKS 

In  this  paper  we  have  derived  genuine  saddlepoint  expansions  for  conditional  densities  and 
distributions  with  nonlinear  conditioning.  The  resulting  fomulas  are  highly  accurate  even  for  small 
sample  sizes  and  are  easy  to  use.  Formula  (17)  reduces  to  Slcovgaard’s  (1987)  formula  when  the 
conditioning  variable  Z  =  g(!^,  Y)  is  a  linear  function  of  X  and  Y. 

Resampling  methods  such  as  bootstrap  are  very  useful  in  conditional  inference  (see,  for  example, 
Hinlcley  (1988)).  However,  computational  difSculty  is  generally  a  major  obstacle  in  such  approaches, 
evidently  more  so  than  in  classical  parametric  problems.  Besides  their  direct  use  In  parametric 
applications,  the  new  approximations  provide  a  possible  remedy  to  overcome  the  difficulties  in  many 
such  cases.  Further  investigations  are  needed  to  draw  a  solid  conclusion  on  this  issue.  We  have 
assumed  continuity  of  the  statistics  in  our  derivations  while  those  in  the  resampling  schemes  are 
discrete,  although  they  become  smooth  as  the  sample  size  increases.  There  is  now  an  interesting 
question  as  to  whether  the  formulas  derived  here  or  their  modified  versions  are  valid  for  such  purposes. 
A  detailed  examination  of  this  question  is  beyond  the  scope  of  this  paper.  Recent  work  by  Wang 
(1990a)  on  the  validity  of  saddlepoint  expansions  in  the  problem  of  bootstrapping  a  sample  mean  is 
relevant. 
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Table  1.  Saddlepoint  approximatioa  (17)  to  the  conditional  distribution  HQ(to|  z) 


in  the  second  example  in  Section  4;  z  =  0.5  . 


n  =  5 

1 

a  =  10 

*■0 

saddlepoint 

true 

^0 

saddlepoint 

true 

0.03 

.00004 

.00004 

0.15 

.00002 

.00002 

0.05 

.00016 

.00016 

0.20 

.00011 

.00011 

0.10 

.00103 

.00102 

0.40 

.00351 

.00351 

0.30 

.01803 

.01803 

0.60 

.02220 

.02223 

0.50 

.06295 

.06306 

0.80 

.07111 

.07119 

0.70 

.13418 

.13446 

1.00 

.15644 

.15660 

1.00 

.27298 

.27354 

1.20 

.27179 

.27202 

1.40 

.47269 

.47348 

1.50 

.46870 

.46900 

1.80 

.64674 

.64754 

1.80 

.65067 

.65097 

2.20 

.77748 

.77815 

2.10 

.79010 

.79034 

2.80 

.89815 

.898u4 

2.40 

.88318 

.88335 

3.30 

.95012 

.95036 

2.80 

.95160 

.95169 

4.00 

.98293 

.98303 

3.30 

.98588 

.98591 

5.50 

.99860 

.99861 

4.20 

.99884 

.99884 

6.50 

.99976 

.99977 

4.70 

.99974 

.99974 

7.50 

.99996 

.99996 

5.40 

.99997 

.99997 
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